Phonological constraint induction in a connectionist network: learning OCP-Place constraints from data

نویسندگان

  • Colin Wilson
  • John Alderete
چکیده

A significant problem in computational language learning is that of inferring the content of well-formedness constraints from input data. In this article, we approach the constraint induction problem as the gradual adjustment of subsymbolic constraints in a connectionist network. In particular, we develop a multi-layer feed-forward network that learns the constraints that underlie restrictions against homorganic consonants, or ‘OCP-Place constraints’, in Arabic roots. The network is trained using standard learning procedures in connection science with a representative sample of Arabic roots. The trained network is shown to classify actual and novel Arabic roots in ways that are qualitatively parallel to a psycholinguistic study of Arabic. Statistical analysis of network behavior also shows that activations of nodes in the hidden layer correspond well with violations of symbolic well-formedness constraints familiar from generative phonology. In sum, it is shown that at least some constraints operative in phonotactic grammar can be learned from data and do not have to be stipulated in advance of learning.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Modelling of Sound Pattern Acquisition

Adam Albright (MIT) Well-formedness across the word: Modeling markedness interactions 9:30-10 Vsevolod Kapatsinski (U. Oregon) Humans and models learning palatalization patterns in miniature artificial languages: In support of particular salience of typical product characteristics 10-10:30 Jeff Heinz (U. Delaware) Learning gradient long-distance phonotactics by estimating strictly piecewise dis...

متن کامل

Nonparametric Learning of Phonological Constraints in Optimality Theory

We present a method to jointly learn features and weights directly from distributional data in a log-linear framework. Specifically, we propose a non-parametric Bayesian model for learning phonological markedness constraints directly from the distribution of input-output mappings in an Optimality Theory (OT) setting. The model uses an Indian Buffet Process prior to learn the feature values used...

متن کامل

Similarity Avoidance and the Ocp

It has long been known that verbal roots containing homorganic consonant pairs are rare in Arabic, motivating the existence of an OCP-Place constraint (Obligatory Contour Principle on place of articulation) in the phonological grammar. We explore this constraint using an on-line lexicon of Arabic roots. The strength of the constraint is quantified by the ratio of the observed number of examples...

متن کامل

Can connectionist models of phonology assembly account for phonology?

Connectionist models have gained considerable success as accounts of how printed words are named. Their success challenges the view of grapheme-to-phoneme correspondences (GPCs) as rules. By extension, however, this challenge is sometimes interpreted also as evidence against linguistic rules and variables. This inference tacitly assumes that the generalizations inherent in reading (specifically...

متن کامل

Lexically Ranked Ocp-place Constraints in Muna

In this paper, we analyze the consonant co-occurrence restrictions in the Austronesian language Muna. As in Arabic and other languages, homorganic segments are underrepresented, particularly ones that are also similar in other ways. However, in Muna [voice] plays an unusually central role in this pattern. We analyze the Muna restrictions within Optimality Theory, using OCP-PLACE constraints rel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012